Speech-Music Classification Model Based on Improved Neural Network and Beat Spectrum

نویسندگان

چکیده

A speech-music classification method according to a developed neural system and beat spectrum is proposed achieve accurate of through pre-emphasis, endpoint detection, framing, windowing other steps preprocess collect vocal music signals. After fast Fourier transforms triangle filter processing, the Mel frequency cepstrum coefficient (MFCC) obtained, discrete cosine transform performed obtain signal MFCC characteristic parameters. calculating similarity feature parameters similarity, matrix based on which obtained. The residual structure optimized by adding Swish max-out activation functions, respectively, between convolutional network layers build convolution deepen number layers. connected time series (CTC) used as objective loss function. It applied softmax layer deep optimization for model. pitch input information model realize classification. experiment proves that accuracy design higher than 99%; when iteration reaches 1200, training approaches 0; signal-to-noise ratio 180dB, sensitivity specificity are 99.98% 99.96%, respectively; voice 99%, running 0.48 seconds. has been proven high accuracy, low loss, good special effects, can effectively speech-music.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Music content authentication based on beat segmentation and fuzzy classification

Digital audio has been ubiquitous over the past decade. Since it can be easily modified by editing tools, there has been a strong need to protect its content for secure multimedia applications. Previous audio authentication algorithms are mainly focused on either human speech or general audio with music as part of the test data, while special research on music authentication has been somewhat n...

متن کامل

Electrocardiogram Beat Classification using Probabilistic Neural Network

The Electrocardiogram (ECG) plays significant role in assessing patients with abnormal activity in their heart. ECG recordings of the patient taken to analyze abnormality and classify type of disorder present in the heart functionality. An Electrocardiogram is a bioelectrical signal that records the heart’s electrical activity versus time. It is used to measure the rate and regularity of heartb...

متن کامل

Service Classification Based on Improved BP Neural Network

With the development of the Internet, several candidate services have emerged for achieving the same task, most of which are functionally identical but different in non-functional properties. Therefore, these services can be classified into different service-quality levels. The so-called Quality of Service (QoS) comprises a set of non-functional properties that can be used to efficiently classi...

متن کامل

An Improved Fuzzy Neural Network for Solving Uncertainty in Pattern Classification and Identification

Dealing with uncertainty is one of the most critical problems in complicatedpattern recognition subjects. In this paper, we modify the structure of a useful UnsupervisedFuzzy Neural Network (UFNN) of Kwan and Cai, and compose a new FNN with 6 types offuzzy neurons and its associated self organizing supervised learning algorithm. Thisimproved five-layer feed forward Supervised Fuzzy Neural Netwo...

متن کامل

mortality forecasting based on lee-carter model

over the past decades a number of approaches have been applied for forecasting mortality. in 1992, a new method for long-run forecast of the level and age pattern of mortality was published by lee and carter. this method was welcomed by many authors so it was extended through a wider class of generalized, parametric and nonlinear model. this model represents one of the most influential recent d...

15 صفحه اول

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: International Journal of Advanced Computer Science and Applications

سال: 2023

ISSN: ['2158-107X', '2156-5570']

DOI: https://doi.org/10.14569/ijacsa.2023.0140706